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What Is Claimed Is: 



1. A database comprising a plurality of records, said records containing 
phenotype information and optionally sample information for an individual, 
wherein the! record for the individual further comprises confounding 
information, Wnd the sample information for the individual comprises 
information renting to the location of a sample of tissue or of fluid from the 
individual. 

2. A database according to Claim 1, wherein the record for an individual 
comprises information relating to a plurality of phenotypes and the record 
comprises, in respect of each phenotype :- 

the phenotype observed; and 

information relating to actual or potential confounding indicators 
in respect of phenotypdt. 



A database 



iccoramg to Claim 1, wherein said confounding information 



is selected from information selected from the group consisting of medication 
being taken by tfy£ individual, medical history, occupational information, 
information relating to the hobbies of the individual, diet information, family 
history, normal exercise routines of the individual, age and sex. 



4. A database according\ to Claim 1, wherein the phenotype and 
confounding information is collected at the same time from the individual. 



5. A database according to Ckim 1, comprising a plurality of records, each 
record containing genotype information, and optionally sample information for 
an individual, wherein: 



# 
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the phenotype information for the individual comprises at least one of 
and optionally all of osteoporosis related phenotypes, osteoarthritis related 
phenotypes, immune cell subtypes (such as Tcell subsets), metabolic 
syndrome/syndn^ne X related phenotypes, and hypertension related phenotypes; 
and 

the sample information for individual comprises information relating to 
the location of a samfele of tissue or of fluid from the individual. 



6. A database according to Claim 5, wherein the phenotype information 
further comprises at ledst one of and optionally all of thrombosis/fibrinolysis 
phenotypes, haemoglobmopathy related phenotypes and airways disease 
(asthma) phenotype. 

7. A database accordirk to Claim 6, wherein the phenotype information 
further comprises information relating to one or more of the phenotypes: 
atopy/eczema, lung functiop^E, psoriasis, acne, skin cancer and moliness of 
skin. 



8. A database according to\ Claim 1 comprising a plurality of records for 
human individuals. 

9. A database according to Cliim 1 wherein the sample of tissue or of fluid 
is selected from the group consisting of urine, serum, skin, liver, heart, bone, 
hair, muscle, kidney, tooth, saliva, faeces and DNA. 

10. A database according to Cklim 1 wherein the sample information 
comprises the geographical location of the sample, the storage conditions of the 
sample and the storage reference numbel for reference label of the sample. 
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11. database according to Claim 10 wherein the sample information 
additionally comprises contact information enabling the individual to be 
contacted and retested in person. 

12. A database according to Claim 1, wherein each record further includes 
genotype inf<Wation for the individual comprising one or more single 
nucleotide polymorphisms. 

13. A database according to Claim 1, comprising genotype information 
selected from one or more of: 

(i) \ actual or inferred DNA base sequence at one or more 
regions within the genome; 

(ii) a record of variation between a specified sequence on a 
chromosome of that individual compared to a reference sequence; and 

(iii) lerigth of a particular sequence or a particular sequence 
variant. \ 

14. A method of a^dijagrWormation to a database according to Claim 1 
comprising: / \ * 

(1 ) identifying an ihdividukl not yet included in the database; 

determining phenotype information for the individual; 

determining confounding information in respect of that phenotype 
information for the individual; \ 

optionally determining geAotype information for the individual; 

optionally determining simple information for the individual that 
includes information relating to thi location of the sample of tissue or of fluid 
from the individual; and \ 

creating a record in the databkse to hold the phenotype, confounding and 
optionally genotype and/or sample information for the individual; 
or \ 
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(2) identifying an individual already included in a record in the database; 

usiAg sample information in the database to obtain a tissue or fluid 
sample for the individual; 

testing the sample, thereby determining genotype or phenotype 
information for the individual; and 

adding 6r confirming or amending or updating information in the record 
for the individual 



10 



15 



15. A method \of identifying a correlation between phenotype information 
and genotype information comprising: 

selecting a phenotype characteristic; 

identifying a Wurality of records from the database of Claim 1 for 
individuals that comply with the selected phenotype characteristic; and 

taking account <k the confounding information, determining if presence 
of the selected phenotybe characteristic is correlated with presence of any 
genotype characteristic in the genotype information for records in the database. 
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16. A method of i 
information and second 
selecting a first 
identifying a p 
individuals who comply 
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itify^ng a correlation between first phenotype 
information comprising: 
Laracteristic; 

of records in the database of Claim 1 for 
ith thfc first phenotype information; 
determining if presence df the selected first phenotype is correlated with 
second phenotype information of iecords in the database. 

1 7. A method of identifying a correlation between genotype information and 
genotype information comprising: 

selecting a genotype characteristic; 

identifying a plurality of records in the database of claim 1 for 
individuals who comply with the genotype characteristic; 
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detdrmining if presence of the selected genotype characteristic is 
correlated \^ith another characteristic of genotype information or records in the 
database. 

18. A method of allocating priority to a candidate gene or locus, proposed as 
a drug target foV treatment of a disease, the method comprising:- 

calculatW from data on a database according to Claim 1, the specificity 
of the candidate gene or locus for the disease; 

comparing (i) the association of the disease with clinical risk traits 
related to the disease, to (ii) the association of the disease with other clinical risk 
traits unrelated to the disease, but representing significant side effects; and 

hence calculating a likely therapeutic index of drug candidates acting on 
that gene or locus. 



19. A method of .walyWg the relation between a genotype and a phenotype, 



/ 

comprising 

selecting a phen<! 



•type 3 



sharacteristic; 

identifying k plvJjrality of records in a database according to Claim 1 
complying with that characteristic; 

using environments and age-related data in the database to eliminate the 
effects of age and environnteit on variations in phenotype; and 

hence calculating frdm the database whether and if so to what extent the 
phenotype is correlated with k particular genotype. 



20. A method of determining the capacity and specificity of a genetic marker 
to detect and quantify normal variations in healthy and affected populations for a 
selected risk trait, comprising: 

assaying samples in a dktabase according to Claim 1 for the marker 
levels, in both healthy and affected! subjects; and 
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quantifying the association of the clinical trait with the marker level and 
other selected phWiotypes, in unaffected and affected subjects. 

21. A method \ of devising dose regimes and/or dose forms and/or drug 
delivery systems folr a given drug in a clinical trial, comprising:- 

selecting a proposed clinical population for the trial; 

using data oft a database according to Claim 1 to stratify the clinical 
population by high associations of metabolism or absorption of the drug both 
with genotype and/or With associated biochemical and cell biology phenotypes; 
and 

hence allowing c^finition of the best dose regimes and dose forms/drug 
delivery systems; 

so as to predict anWor allow for absorption and/or metabolism of the 
drug by patients in the clinickl population. 



22. A method of predidtiW response to a proposed drug therapy, 
comprising :- 

; ordiAg tb Claim 1 to select a clinical population by 
constructing haplotypic profiles, WithWong associations with defined clinical 
traits and biochemical phenotypes;! 

using the database to eliminate the effects of age and environment in the 
clinical population; 

hence providing criteria to predict response to the drug and variation in 
response to the drug, and optionallV to define a sub-group of the clinical 
population or of the general population most susceptible to the drug being 
studied. 



23. A method of correlating genotVpe and phenotype information with 
account taken of potential or actual confolmding information, comprising use of 
the database of Claim 1 . \ 
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24. A method of diagf 
individual not showing si| 
database of Claim 1 



sing disease or predisposition to disease in an 
cant signs of disease, comprising use of the 
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